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Method for Comparing Features Extracted from Images of Fingerprints 
Field of the Invention 

The present invention relates generally to comparing features extracted from images, 
and more particularly to comparing unknown fingerprints with known fingerprints. 

Background of the Invention 

Images of fingerprints are routinely used to identity individuals. Because each 
person's fingerprints are unique, images of unknown fingerprints can be compared 
with images of known fingerprints stored in a database. For example, the database of 
fingerprints in the FBI archives includes over 30 million ten-print "cards." When a 
matching set of fingerprints is found in the database, the identity of the person can be 
verified. Standard image formats have been adopted for recording and storing images 
of a person's fingerprints along with additional information such as: names, alias 
names, birth date, height, weight, hair color, eye color, race, and so forth. 

Prior art fingerprint verification systems typically include techniques for locating, 
classifying, and identifying key features of fingerprints such as pattern type, ridge 
features and direction. Ridge features are defined by bifurcations and endings of 
ridge flows on a person's fingers. These minutiae include cores, deltas, whorls, 
loops, arches, tented arches, and the like. The minutiae data can include (x, y) 
coordinates of their locations and degree of orientations (q). 

The well known Henry system is the predominant manual system used by law 
enforcement agencies for fingerprint identification. However, the Henry system uses 
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a relatively small number of classifying characteristics. Hence the total number of 
identifier codes is too small to uniquely account for the millions of fingerprints in a 
comprehensive database. 

Generally, three types of automated comparison methods are used for searching a 
fingerprint database. The first method uses a ten-print-to-ten-print comparison where 
each finger and its orientation is known. There, the set of prints is complete, and the 
quality of the images is good. The result of the first method is usually conclusive. 
The second method uses a latent-to-ten-print comparison. The latent print can be 
"lifted" from some arbitrary surface through a variety of known techniques. In 
contrast with the ten-print method, latent prints are usually partial and of poor 
quality. Often, the finger numbers and their orientations are also unknown. The third 
method uses a latent-to-latent comparison to determine if two separately obtained 
prints belong to the same person, even though the exact identity of the person may be 
unknown. 

Generally, automated comparison is performed by first aligning an unknown "test" 
print with a known or "reference" print. Then, the relative spatial and angular 
position of comparable minutiae in the two prints are superimposed, evaluated and 
scored according to the number of minutia that are common to the two prints. The 
method completes when the unknown print has been compared with all closely 
matching known prints. A high score indicated a larger number of common minutia 
and a probable match. Typically, each pair of matching minutiae increases the score 
by one. Some typical prior matching methods are described in National Bureau of 
Standards (NBS) Technical Notes 538 and 878, and NBS Special Publication 500-89. 
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Prior art fingerprint comparison systems only work well with uniform, high quality 
images of a ten-print set. However, images of latent prints are often partial and low 
contrast, making the systems unreliable and inconsistent. In addition, fingerprint 
artifacts such as cuts, scrapes, abrasions, and scars can lead to "false" minutiae, such 
5 as breaks, islands, short branches, lakes, and joins. False minutiae cause 

identification failures and necessitate operator intervention, which increases cost and 
reduces throughput. 

Sasakawa et al.,. in "Personal Verification System with High Tolerance of Poor 
10 Quality Images;' SPIE Vol. 1 386, Machine Vision Systems, pp. 265-272, 1990. 
M describe a fingerprint verification system (FVS) that uses image enhancement 
13 techniques, such as a directional spatial filter and local thresholding, to extract 
y minutiae data. They use both coarse and fine matching for minutiae data, see also, 
q U.S. Patent No. 6,229,922 "Method and Apparatus for Comparing Incoming Data 
Jl5 with Registered Data" issued to Sasakawa et al. on May 8, 2001 . Their similarity 
g score is based on a normalized integer count of the number of matching minutiae. 
% Their method scores all matching minutiae equally. That can lead to false 

acceptances and false rejections, particularly when similar minutiae are located near 

each other. 

20 

Therefore, there is a need for a fingerprint comparison method that can better 
discriminate features in images. 

Summary of the Invention 

25 

According to the invented method, features are extracted from a test and reference 
image to generate a test and reference record. In one application of the invention, the 
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images are of fingerprints, and the features are fingerprint minutiae. The images can 
be enhanced before feature extraction. A large number of reference records can be 
stored in a database for fingerprint identification. In this case, the test record is 
compared with each of the reference records. 

5 

Each feature has a location, and orientation, and furthermore, the features of the 
reference records also have associated weights, which are precomputed. First, the 
features of the test record are approximately aligned with the features of the 
reference record. The alignment can be a rigid transformation using global XY 
10 translation, and rotation. 

13 Then, differences between the locations and orientations of the features of the 

■SI reference record and the features of the test record are measured. The weights of all 

*#* 

P features of the reference record that are less than a predetermined difference when 
Tl5 compared with the features of the test record are summed to determine a similarity 

i s 

q score that the test record matches the reference record. 



few 

u 



In one aspect of the invention, the features are represented using a probabilistic 
density function and the transformation only uses global translation to approximately 
20 align the records. 

Brief Description of the Drawings 

Figure 1 is a block diagram of a method for comparing image features according to 
25 the invention; 

Figure 2 is a diagram of extracted features showing their locations and orientations; 
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Figure 3a-b are diagrams of aligned features; and 

Figure 4a-b are diagrams of weighted features according to the invention. 

5 

Detailed Description of the Preferred Embodiment 

Method Overview and Data Structures 

10 Figure 1 shows a method 100 for comparing features of a test record 101 with 
g reference records 102 stored in a database 150. More particularly, the test record 101 

includes minutiae data of an unknown fingerprint, and the reference records 102 
k >£ include minutiae data of known fingerprints. 

« 15 In a practical application, the purpose of the method 100 is to determine to what 
h extent the unknown fingerprint is similar to any of the known fingerprints. As shown 
0 in Figure 1, the method 100 according to the invention includes three basic steps, 
p feature alignment 110, difference measuring 120, and weighting 130 to produce a list 
103 of scored candidate reference records that most likely match the test record 101. 
20 The invention also determines 140 weights 131 for the features of the reference 
records in a preprocessing step. 

As shown in Figure 2, the test record (T) 101 represents the features (minutiae) 200 
of the unknown fingerprint as M vectors {xj, y j9 qj} ^ . For example, the jth feature 

25 201 has coordinates x and y 202 of its location, and a direction q 203 of its 

orientation, where M is the number of extracted features in the record, twenty-nine in 
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this example. Similarly, the features of each reference record (R) 102 stored in the 
database 150 is represented by N vectors {x h y h * . It should be noted that the 
number of features in the test and reference records do not need to be identical. 

5 As described below in greater detail, the features of the reference records have 
precomputed associated weights, and all features can be represented by a 
probabilistic density model. 

The features 200 can be extracted from an image of an actual fingerprint, or a latent 
10 print using conventional techniques, for example, see Sasakawa et al., in ''Personal 
jJJ Verification System with High Tolerance of Poor Quality Images" SPIE Vol. 1386, 
| Machine Vision Systems, pp. 265-272, 1990, and U.S. Patent No. 6,229,922 
2 "Method and Apparatus for Comparing Incoming Data with Registered Data" issued 

0 to Sasakawa et al. on May 8, 200, both incorporated herein by reference. 

S3 
f 15 

p Method Operation 

p During operation of the method 100, the test record 101 is compared with each of the 

reference records 102 to produce the list 103 of likely candidate reference records. 
20 The test record is first aligned 110 with each reference record. The alignment is a 
rigid transformation that can include global XY translation and rotation. Because 
fingerprint images are generally standard sizes, scaling is usually not necessary. 

The alignment 110 can use the Sasakawa ridge-direction displacement (dX, dY), 
25 referenced above, to bring the (x, y) components of the features of the test record 101 
into approximate alignment with the features of each reference record 102. 
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Then, a similarity score S of the M features in the test record T matching any of the 
features in the reference record R is determined by the total average: 

S(T, R) = — - — (#of matches) w, 
N + M 

which when expanded in terms of R yields 

9 M N 

(/V +M ) j=\ ;=1 

where w is the weight, and/ is a function that measures the difference between the 
x, y, and q components of the test and reference features. The function /(.) can be a 
discrete step function with values {0,1 } indicating either no match or a match, or a 
continuous smooth function in the range [0.1] based on a probability of matching. 

If all of the differences for a particular pair of features are less than predetermined 
thresholds, the features are considered matching, for example, \x r x,l< T x , and \yj - y t \ 
and < 7>, \q r qi\ < T Q . If the number of matching features is large, e.g., the similarity 
H score S > T MAX , the test record likely matches the reference record. It should be 
145 noted, that the search space for matching pairs of features can be restricted to only 
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f s. 



those pairs of features that are approximately aligned according to their x, y, and q 
values. 



Relevance Weights 

20 

In order to improve the discrimination, an set of relevance weights w,- 131 is provided 
for each reference record 102. The weights in a particular set, unlike the prior art, are 
not necessarily equal. This is based on the assumption that not all features should 
count equally when contributing to the similarity score. 

25 
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To illustrate this notion, consider the features 1-29 shown in Figure 3a and Figure 3b. 
In Figure 3a, five acceptable matches (16, 19, 22, 24, 26) from a test record 301 are 
shown with open circles, all other features are indicated by closed circles. Similarly, 
figure 3b shows a case where there are also exactly five matches (1, 2, 4, 6, 21), but 
5 now to different features in a reference record 302. The question is: which of these 
two matches is more reliable or significant, given that they both have the identical 
similarity score value of five. 

The matches in Figure 3a are from tightly clustered regions 310 of features which 
10 have similar orientations. This means that matches in these regions have "support" 
from several neighboring features, all of which can contribute their share to the 

P 

0 similarity score because they all partially "explain" the observed data in the test 
HI record. 

rl5 On the other hand, in Figure 3b, matches of features 1 and 2 are relatively "isolated" 
q region 320 where the chance of an accidental alignment is less. Likewise a match to 

j ,. 

feature 6 is less likely to be due to chance since its nearest neighbor feature 4 has the 
complete opposite orientation. Similarly, feature 21 is surrounded by opposite 
orientation features, and is hence also "isolated." 

20 

Therefore, one can posit that unique or "isolated" pairs of features are much harder to 
match, and whenever an acceptable match is found for a pair of isolated features, the 
similarity score should count more than when a match which can be "explained" by 
multiple surrounding features with partial overlap. Consequently, the invention 
25 makes a feature weight w, proportional to its uniqueness or distance with respect to 
its nearest neighboring features. 
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Therefore, the invention identifies a local neighborhood of the k nearest neighbor 
features, and the set the weight proportional to a function of these k distances: 

Wi= N—t; , — — 

Y,%D{d),d),...,d]) 

where d" is the nth distance of feature i. Note, that these weights are normalized to 
sum to N. 



Any number of distance functions D can be used to measure the distance from a 
particular feature to its neighboring features including the arithmetic mean 

D 0 (d 1 ,d 2 ,...,d k )=jid\ 

k n=l 

the geometric mean 

Djid 1 , d 2 , ... , d k ) = 
and the maximum D 2 which is simply dt. 




In all three cases, the distances are determined for the x, y, and q components of 
each feature in each reference record to derive the corresponding sets of weights w ( 
131. The sets of weights need only be determined 140 once for each reference 
record, perhaps in an off-line preprocessing step. For completeness, the reference 
record (R) 102 could be represented by the vector {x„ y it q h wj J, . The weights w, in 
a particular reference record can be normalized to sum to one. Then, when there is a 
match between two features, the weighted similarity score S 132 is incremented by 
the corresponding weight, instead of by one as in the prior art. 



Probabilistic Feature Matching 
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As stated above, the alignment step 110 performs translational and rotational rigid 
transformations. However, by using a probabilistic model, as described in greater 
detail below, the alignment can be less precise, and the rotational transformation step 
5 can be eliminated, decreasing the number of computations. 

According to a kernel density model of the invention, the reference records are built 
using radial Gaussian functions: 

Pr{x, y, q) = - x-a 2 x )f(y - y^f^q - q t \a\) 



MlO where 

& 



i 

The function f(0; c 2 ) is a standard zero-mean Gaussian with a variance a 2 , e.g.: 
/(x;cr 2 ) = e-* 2 ' 2 *\ 



p The function /^(0; cr 2 ) is also Gaussian, except that in subtracting q t from q, the 

boundary conditions at 0° and 360° are handled properly, e.g., 5°and 355° are 
considered to be ten degrees apart. 



20 The similarity score S is now determined by: 

2 M 

s(T ' R)= ^¥° (x " y " q ' h 

which when expanded in terms of P R yields 
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S(T, F) = 



9 M N 

(N + M) j-\ ,=, 



M i\ 

S 5>,/(*, -x i \a 2 x )f{y j - y i a 2 y )f q {q j -<?,;<), 



and the distances are determined in a manner similar to log-probabilities, in terms of 
the individual a terms for (x, v, q). In other words, the mutual distance between 
features i and j are given by a normalized (balanced) L p norm 



d = 



(x.-XjY (yr-xyjY + (q t - qj )> 



-\\ip 



a" a" 
where for a Euclidean metric, p = 2. 



Note that in subtracting orientations q, the boundary condition and "wrap-around"' at 
p 0 and 360 degrees must again be properly handled as with determining the modified 

§io Gaussian^ in P R . The variances <7 w in (x, y, q) are needed to account for independent 

Si 

$ weighting of distances, and also to account for the range differences between (x, v) 

0 and q. 



U Figures 4a-b show the "relevance" weighting under two different conditions or 
ft 5 "relevance" assumptions. Figure 4a shows these weights as vertical lines for a case 
where (x, y) are equally weighted but a is intentionally set large to disable the 

effect of orientation similarity. This results in a set of weights which only take into 
account the spatial geometry of the features, i.e., the (x, y) k neighborhood structure. 
For example, in Figure 4a features 1 and 2 have the largest weights due to their 
20 spatial isolation and features 4 and 6 have relatively low weights given their 
proximity. In contrast, Figure 4b shows a more balanced set of a w values which 

incorporate orientation differences as well, refer back to Figures 3a-b to see this 
better. 
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Notice that in Figure 4b features 1 and 2 no longer have the maximal weights, 
instead feature 6 has the largest weight due to the fact that there are no minutiae of 
similar orientation in its proximity. Compare this with the XY-only weighting 
scheme in Figure 4a in which both features 6 and 4 are de-emphasized simply 
5 because of their XY proximity. The relative <j w values that optimize performance 

are a compromise with an intermediate value of a . Appropriate values of all the 

above parameters which optimized the performance of the method according to the 
invention are given below. 



10 Tuning and Results 

10 For best performance, several parameters can be "fine-tuned" for a specific 

H 

4p application. Foremost are the kernel parameters used in the expansion P R , mainly the 

jgj variances of x, y, q and w. Then the best-performing kernel parameters for the 

jj.5 Gaussian mixture, in terms of error rates, and best trade-off between spatial (XY- 

\1 only) weighting, and combined spatial-orientation weights, can be selected. 

^ The probabilistic matching technique, with only global XY translation, can reduce 
the false acceptance error rate by up to a factor of six, for the same false rejection 
20 rate, with significantly less computation because the costly fine-alignment search as 
done by the prior art Sasakawa method is eliminated. 



Although the invention has been described by way of examples of preferred 
embodiments, it is to be understood that various other adaptations and modifications 
25 may be made within the spirit and scope of the invention. Therefore, it is the object 
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of the appended claims to cover all such variations and modifications as come within 
the true spirit and scope of the invention. 
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